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Abstract:- In this study, trans -consonantal vowel-to-vowel coarticulation in Chinese is investigated. The target 
words are in the form of 'bVl.ba', and the subjects are eight native speakers of standard Chinese. Vowel 
formants are examined at the onset, middle and offset points of the target vowel. Results show that trans - 
segmental coarticulation exists in Chinese, especially for the second formant value of the vowel, and in Chinese, 
coarticulatory effect does not extend to the offset point of the vowel. 
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I. INTRODUCTION 

This study deals with the extent of trans -consonantal vowel-to- vowel coarticulation in bi-syllabic 
words in Chinese. Coarticulation refers to the fact that speech sounds are not produced as isolated gestures, but 
are superimposed in a complex, context-dependent fashion. While it is clear that all natural speech is 
coarticulated, the magnitude and temporal extent of coarticulation in different contexts has been difficult to 
explain. The need to adequately describe the varieties of coarticulation has been a key factor driving the 
development of competing models of speech production and also perception [1]. 

It is shown from researches on speech production that there are systematic differences between 
languages in the spatiotemporal characteristics of coarticulation [2]. Ohman [3] compared the coarticulatory 
effects in three languages, and found that the F 2 values of target vowels varied more in English and Swedish 
than in Russian, due to vowel context. He attributed the coarticulatory differences to the languages' consonant 
systems, arguing that the requirements on the tongue body imposed by contrastive palatalization in Russian 
restricted trans -consonantal coarticulation. 

Now that the effect of transconsonantal vowel-to-vowel coarticulation is diversed among languages, 
researchers have sought to understand how different factors influence this effect, and it is shown that a 
language's system of vowel contrasts may influence V-to-V effect. Beddor et al. [4] conducted three 
experiments to test the hypothesis that V-to-V coarticulatory organization differs in Shona and English. An 
acoustic study of Shona and English trisyllables shows that the two languages differ in the coarticulatory effect 
of stressed and unstressed vowels on each other, and the relation between the production and perception data 
suggests that listeners are attuned to native-language coarticulatory patterns. 

Research results show that languages with larger vowel systems tend to exhibit weaker V-to-V 
coarticulatory effects than those with smaller systems. Weaker effects have been shown for English compared to 
the much smaller five-vowel systems of Shona, Swahili [5]. Cho [6] examined how the degree of vowel-to- 
vowel coarticulation varies as a function of prosodic factors, and results show that vowels in prosodically 
stronger locations are coarticulated less with neighboring vowels, but do not exert a stronger influence on the 
articulation of neighboring vowels. An examination of the relationship between coarticulation and duration 
reveals that accent -induced coarticulatory variation cannot be attributed to a duration factor, and some of the 
data with respect to boundary effects may be accounted for by the duration factor. He proposed that prosodically 
conditioned V-to-V coarticulatory reduction is another type of strengthening that occurs in prosodically strong 
locations. The prosodically driven coarticulatory pattern can be taken as part of the phonetic signaling of the 
hierarchically nested structure of prosody. 

In regard to the extent of coarticulation, studies on various languages have shown vowel-to-vowel 
coarticulatory effects not only in transitions, but extending into the steady-state period of the transconsonantal 
vowel both in palatographic data [7-8] and in acoustic data [9-10]. While there is ample evidence of the 
existence of vowel-to-vowel coarticulatory effects, factors have been cited which affect the extent of these 
effects. For instance, these effects may be constrained by intervocalic palatals and velars, whose production 
requires use of the tongue body in conflict with the production of vowels, thereby restricting vowel-to-vowel 
coarticulation [11]. 

It is indicated in early studies that V-to-V coarticulation might be a relatively local phenomenon [12], 
subsequent work has shown that this is not always the case. Instances of long-distance coarticulation, involving 
effects crossing two or more intervening segments have been found [13-14]. Magen [15] analyzed [bVbsbVb] 
sequences produced by four English speakers and found evidence of coarticulatory effects between the first and 
final vowel, meaning that such effects can cross foot boundaries and multiple syllable boundaries. More 
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recently, Grosvald [16] studied long-distance vowel-to-vowel coarticulation in English, and found that 
anticipatory V-to-V coarticulation can occur over at least three vowels' distance in natural discourse, and that 
even such long-distance effects can be perceived by some listeners. 

There has been some research work on the coarticulation of segments in Chinese. Wu and Sun [17] 
analyzed the acoustic coarticulatory patterns of voiceless fricatives. The fricatives are designed in CVCV 
contexts in which three peripheral vowels are combined with the fricatives in d and C 2 . Acoustic data, 
including frequencies and durations of the lower margins, concentration bars, vowel formants and the onset and 
offset transitions, are measured. It is found that there are three types of coarticulation effects: homorganic, 
heterorganic and contiguous. 

Yan [18] studied the vowel formant pattern and the coarticulation effect in the voiceless stop onset 
monosyllables, and it is found that there is no effect of tones on formant values of the vowel. However, there is 
significant effect of aspiration on the formant values at the onset point of the vowel. Chen [19] investigated the 
intersyllabic anticipatory coarticulation in CVCV sequences, and found that there is no effect of articulation 
manner on the formant values. There is coarticulatory effect of C 2 on V b as well as V 2 on V^ Sun [20] analyzed 
the coarticulation effect of vowels in read speech, and found that, as the speaking rate increases, the deviation of 
the vowel formant is also magnified, as is shown in the apparent centralization of the vowel. In fast speech, the 
fromant values of HI and Iwl are significantly different, due to the frontness and the backness of the following 
consonants. Besides, there is also the study on the anticipatory coarticulation in V1#C2V2 sequences [21], and 
coarticulatory effect in VCV sequences [22]. However, as far as we know, there have no studies on 
coarticulatory effect at different vowel points in Chinese. 

The present study will investigate the effect of trans-segmental coarticulation in Chinese. In particular, 
it will try to answer the following questions. Does trans-segmental coarticulation occur in Chinese? What is the 
extent of coarticulation in Chinese? Coarticulation may be classified as carry-over (left-to-right) or anticipatory 
(right-to-left) ones, and the present study will focus on carry-over coarticulation. 

II. METHODOLOGY 

A. Speakers and stimuli 

The speakers for this experiment were eight native speakers of Standard Chinese, four males and four 
females. The stimulus list is comprised of 6 stimuli, embedded in three carrier sentences of Chinese, each 
containing an item from a set of two target words. The target words are 'Biba' and 'Baba', which are supposed 
to be two persons' names. They are in the form of 'bV L ba', with /a/ as the target vowel, and Vi providing the 
'changing' vowel contexts, namely, the contexts of /a/ and HI. 

Labial lb/ is chosen to minimize the effects of consonant articulation on lingual articulations [3]. 
Within the carrier sentences, the target item is located at the sentence medial position. One example is shown as 
follow, 

Zhe shi Biba de jiejie. 
This is Biba's sister. 

B. Procedure and measurements 

The speakers were asked to read the sentences three times, in random order for each time, in normal 
speed, so each speaker produced 18 tokens: 6 sentences x 3 repetitions. In total, 144 tokens were acoustically 
analyzed (18 tokens x 8 speakers). 

This study aims at investigating the extent of V-to-V coarticulation in VCV sequences, and vowel 
formants were examined. Formant values were extracted using Praat [23], and the extent of trans-consonantal 
coarticulation was analyzed by comparing the formant values of the target vowel at three points: the onset point, 
the middle point and the offset point. That is, formant values at the onset, middle and offset points of the target 
vowel /a/ were extracted, and the values at different vowel contexts were compared. As is mentioned in the 
previous subsection, there are two contexts for the target vowel, HI and /a/. Coarticulatory effect exists if there is 
significant difference between the formant values in the two vowel contexts. On the contrary, there is no 
coarticulatory effect if there is no significant difference. A repeated measures ANOVA was performed for the 
comparison, and statistic analysis was done in SPSS. 

Figure 1 displays the waveforms and formant contours of the two key words 'Biba' (a) and 'Baba' (b). 
In the graphs, the second syllable 'ba' is the key syllables, and the preceding syllables 'bi' and 'ba' provide the 
changing vowel contexts. Formant values are extracted from the onset, middle and offset points of the second 
syllable 'ba', which correspond to point A, B and C in the graphs. Comparison is made for formant values of the 
target values in the two contexts at each of the three points. Taken the onset point, point A, as an example, since 
the contexts shown in graph (a) and graph (b) are different, if the formant values at point A in the two contexts 
are significantly different, it can be concluded that there is an effect of coarticualtion at that point. In this study, 
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comparison will be made for both F[ and F 2 values at each of the three points, the onset, middle and the offset 
point. 




a. The graph of 'Biba' 




i - 



Time HI 

b. The graph of 'Baba' 
Fig. 1: Waveforms and formant contours of the key words 'Biba' (a) and 'Baba' (b) 

III. RESULT 

Figure 2 presents the mean F t (a and b) and F 2 (c and d) values of the target vowel /a/ for male (a and c) 
and female (b and d) speakers, in the contexts of /a/ and /i/, measured at the onset, middle and offset points. One 
of the aims of this study is to investigate the extent of the coarticulation, so formant is measured at three points, 
the onset, middle and the offset points of the target vowel. Analysis is given in the following subsections. 

A. Onset point 

Results from a repeated measures ANOVA show that, when key words occur at sentence medial 
position, at the onset point of the target vowel, with data of both male and female speakers pooled together, the 
effect of changing vowel is significant for F 2 , but not for F h F,: F(l, 71) = 0.04, p = 0.847; F 2 : F(l, 71) = 47.72, 
p < 0.001. That is, coarticulatory effect exists for F 2 , but not for Fj. 

B. Middle point 

At the middle point of the target vowel, it is shown that, similar to that at the onset point, the effect of 
changing vowel is significant for F 2 , but not for F,. F,: F(l, 71) = 0.39, p = 0.537; F 2 : F(l, 71) = 7.28, p = 0.009. 
Coarticulatory effect exists for F 2 , but not for F|. 



C. Offset point 

Coming to the offset point of the target vowel, it is shown that the effect of changing vowel is not 
significant for either F, or F 2 , F,: F(l, 71) = 3.82, p = 0.55; F 2 : F(l, 71) = 0.25, p = 0.621. Coarticulatory effect 
does not exist at that point. 




Onset 



Oflset 



a. Fi for male speakers 
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Middle 



Offiet 



b. Fj for female speakers 
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c. F 2 for male speakers 
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d. F 2 for female speakers 

Figure 2: F[ (a and b) and F 2 (c and d) values of the target vowel /a/ for male (a and c) and female (b 
and d) speakers, in the contexts of /a/ and /i/, measured at the onset, middle and offset points 



IV. DISCUSSION 

From the results reported in the previous section it is noted that, trans -segmental coarticulation does 
exist in Chinese, especially for the second formant value of the vowel. In this experiment, coarticulatory effect 
is examined at the onset, middle and offset points of the target vowel respectively, and both the first and the 
second formant values are investigated. It is found that, as far as carry-over coarticulation is concerned, 
coarticulatory effect exists for the second formant values at the onset and the middle points of the vowel. To be 
specific, at the onset and the middle points of the vowel, there are significant differences between the two 
changing vowel contexts for the second formant, but not the first formant. 

The effect on the first and the second formants are not consistent with each other at the onset and the 
middle point of the target vowel, with F 2 affected, while F[ unaffected. We speculate that the reason for this is 
that the difference between /a/ and /i/ for F 2 is larger than that for Fj. According to the report of Bao [24], the 
mean formant values by 8 male speakers are as follow, /a/: Fi = 984 Hz, F 2 = 1157 Hz; hi: Y x = 283 Hz, F 2 = 
2350 Hz. The differences between /a/ and l\l for F ; and F 2 are 701 Hz and 1193 Hz respectively. The difference 
of F 2 is much larger than that of Fi. When the formant difference is large, the force for triggering the change of 
course of formant contour is also large. That is to say, vowel coarticulatory effect is more likely to occur on 
cases with great formant difference, therefore, it occurs on F 2 , not on Fj. 

When the offset point of the target vowel is investigated, it is found that coarticulatory effect does not 
exist. That is, for either Fi or F 2 , there is no coarticulatory effect. This result is caused by the 'distance effect': at 
the onset and the middle points, the distance from the measured point to the changing vowels is close, and the 
effect exists; while at the offset point, the distance gets farther, and the effect disappears. 
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Magen [15] investigated the extent of vowel-to-vowel coarticulation in English trisyllabic utterances, 
and it was found that coarticulatory effects can, in some instances, extend beyond the bounds that previous 
research had assumed; coarticulatory effect can extend from one full vowel, through the medial schwa, and into 
the midpoint of the next full vowel. He proposed that foot does not define the domain over which coarticulatory 
effects operate. However, in the present study, it is found that coarticulatory effect does not extend to the end of 
the vowel. We speculate this is because that Chinese is of different language typology from English. 

One of the well-known properties of language typology is the rhythm unit of a language. Languages 
have been categorized as mora-timed, such as Japanese, stress-timed, like English and German, and syllable- 
timed, such as Chinese and French [25-26]. English is a stress-timed language, and the unstressed syllables are 
quite weak, so it is possible for coarticulatory effect to extend to the third syllable in English. Chinese is a 
syllable-timed language, in which syllables are rarely as weak as the unstressed ones in English. Compared to 
English, the degree of articulatory constraint (DAC) in Chinese is high. Therefore, the coarticulatory effect in 
Chinese is not as great as that in English. 

V. CONCLUSION 

In this study, the vowel-to-vowel coarticulation effect in bi-syllabic words in Chinese is analyzed, and 
it is found that trans-segmental coarticulation exists in Chinese, especially for the second formant value of the 
vowel. Coarticulation is more likely to occur on F 2 because the difference between /a/ and /i/ of F 2 is larger than 
that of Fi. Because of the 'distance effect', coarticulatory effect exists at the onset and middle points, and 
disappeared at the offset point of the target vowel. In Chinese, coarticulatory effect is not as great as that in 
English, as the two languages are of different prosodic typology. 

This study is significant for speech engineering. In speech synthesis, the effect of trans-segmental 
coarticulation must be taken into consideration, especially for the second formant value of the vowel. The first 
formant is not affected, so it is not necessary to mind about the effect on F[. Nor is it necessary to consider the 
trans-syllabic coarticulatory effect at the end of the vowel, as in Chinese, coarticulatory effect does not extend to 
the end of the syllable. Therefore, this study is helpful for speech engineering technology. 
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